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■•. • DATE: 11/30/97 

INPUT SET: S21826.raw 

This Raw Listing contains the General H^" -A 

^rmtionS^ctionandup^^ -J 

SEQUENCE LISTING A/**, ' ^ 

General information: OT/^ 



1 

2 

3 (1) 
4 
5 
6 



MOCHIZUKI, Shin'ichi 1 CS J ~L4 / 

7 YANO, Kazuki / -sC. 

8 KOBAYASHI , Fumie / 

9 SHIMA , Nobuyuki 

10 YASUDA, Hisataka 

11 NAKAGAWA, Nobuaki 

12 MORINAGA, Tomonori 

1 3 UEDA, Masatsugu 

14 HIGASHIO, Kanji 

TITLE 0, I»™= »0V 61 - '» "° dUCln9 

the Proteins 



15 
16 
17 
18 
19 
20 
21 
22 
23 



(iii) NUMBER OF SEQUENCES: 108 



(B ) STREET: 125 High St, 
It (C) CITY: Boston 

" (D) STATE: MA 

„ (E) COUNTRY: USA 

27 ( F) ZIP : 02110 



28 



" (V) COMPUTER READABLE FORM: 

3 3 D) SOFTWARE: Patentln Release #i.u. 



34 
35 
36 
37 
38 

" B illim DATE; 20-EEB-1995 

4 3 v ' 

44 
45 
46 



(vi , CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



PAGE: 2 



47 
48 
49 



52 
53 



57 

58 

59 

60 

61 

62 

63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 



79 
80 
81 
82 
83 



DATE: i 1/30/97 
TIME: 14:01:08 



INPUT SET: S21826.ruw 

(B) FILING DATE: 21-JUL-1995 



(Vii> ™«£SSSS"— PCT/^6/0037* 
H Jb) FILING DATE: 20-FEB-1996 



(Viii) ATTORNEY/ AGENT INFORMATION 

1 (A) NAME: CAMPBELL, Paula A. 

54 B REGISTRATION NUMBER: 32,503 

5 ! S REFERENCE/ DOCKET NUMBER: FJN-060 

56 * ' 



(ix) TELECOMMUNICATION INFORMATION . 
( ' (A) TELEPHONE : (617) 248-7000 
(B ) TELEFAX : (617) 248-7100 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



74 (ix) FEATURE : 

1S (A) NAME /KEY : Peptide 

S! SSTSSoLSio.. /note- i-terna! «1» «« 

?8 sequence of the protein)" 



( xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Xaa Tyr His Phe Pro Lys 
84 1 5 

11 (2) INFORMATION FOR SEQ ID NO : 2 : 

11 (i) SEQUENCE CHARACTERISTICS : 

11 K ( A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



89 
90 
91 
92 
93 
94 
95 
96 

97 (ix) FEATURE 

98 
99 



(ii) MOLECULE TYPE: peptide 



(A) NAME /KEY : Peptide 

(B) LOCATION: 1..14 



PAGE: 3 



100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

110 

111 

112 

113 

114 

115 

116 

117 

118 

119 

120 

121 

122 

123 

124 

125 

126 

127 

128 

129 

130 

131 

132 

133 

134 

135 

136 

137 

138 

139 

140 

141 

142 

143 

144 

145 

146 

147 

148 

149 

150 

151 

152 



INPUT SET: S21826.raw 
(D) OTHER INFORMATION: /note= "(an internal amino acid 
sequence of the protein)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Gin His Ser Xaa Gin Glu Gin Thr Phe Gin Leu Xaa Lys 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..12 

(D) OTHER INFORMATION: /note= "(an internal ammo 
sequence of the protein)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Xaa He Arg Phe Leu His Ser Phe Thr Met Tyr Lys 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..380 , 
(D) OTHER INFORMATION: /note- "(OCIF protein without 

signal peptide)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp Glu Glu Thr Ser His 



# • 



196 
197 
198 



INPUT SET: S21826.raw 
10 I 5 



153 
154 

155 Gin Leu Leu i^ys i*=>jf * — — - 3Q 

156 
157 



Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His 



20 25 
3 Trp Lys Thr 

40 



Is! cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr 

159 35 40 

Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro 



160 

161 *jr- — ~ - go 

162 50 55 
163 



val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His 



164 vax cys Lys mu 80 

165 65 70 75 

i 6 6 6 7 Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu lie Glu Phe 
168 " Q 



85 90 



1" Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala 

17l 100 105 HO 

\]\ Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg Cys Pro Asp Gly Phe 

I74 115 120 I 25 

"I Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys Arg Lys His Thr Asn 

130 135 I 40 

Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys Gly Asn Ala Thr His 
145 I 50 155 



177 130 135 140 

178 
179 
180 

L 182 Asp Asn lie Cys Ser Gly Asn Ser Glu Ser Thr Gin Lys Cys Gly He 

183 I 65 170 

\Ts Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr 

186 I 80 185 

III Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val Asp Asn Leu Pro Gly 

189 195 200 205 

Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys Arg Gin His Ser 
192 210 215 220 

151 Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn 

195 225 230 " 235 

197 Lys Asp Gin Asp lie Val Lys Lys lie lie Gin Asp He Asp Leu Cys 

Thr 

260 265 270 



245 



250 



200 Glu Asn Ser Val Gin Arg His He Gly His Ala Asn Leu Thr Phe Glu 



201 
202 
203 

204 275 
205 



Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly Ala 

280 285 



INPUT SET: S21826.raw 

206 Glu Asp lie Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp Gin lie 

207 290 295 300 

2S9 Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin Asp Thr 

210 305 310 315 

211 Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe 

325 330 



212 
213 

2^5 Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His 

340 345 



216 
217 
218 

219 355 
220 



253 
254 



Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met lie 

360 365 



2 20 

221 Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 

222 370 375 380 

22 3 

224 (2) INFORMATION FOR SEQ ID NO: 5: 
225 

226 (i) SEQUENCE CHARACTERISTICS: 

227 (A) LENGTH: 401 amino acids 

228 (B) TYPE: amino acid 

229 (C) STRANDEDNESS: 

230 ( D ) TOPOLOGY: linear 
231 

232 (ii) MOLECULE TYPE: protein 

233 

234 

2 35 (ix) FEATURE: 

2 36 (A) NAME /KEY : Protein 

237 ( B ) LOCATION: 1..380 . 

238 ( D ) OTHER INFORMATION: /note= "(OCIF protein) 

239 

240 (ix) FEATURE: 

241 (A) NAME/KEY: Peptide 

242 (B ) LOCATION: -21..0 , 

243 ( D ) OTHER INFORMATION: /note= "(signal peptide) 

244 
245 

246 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

HI Met Asn Asn Leu Leu Cys Cys Ala Leu Val Phe Leu Asp lie Ser lie 

249 "20 "15 

25? Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 

252 -5 1 5 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr 

255 15 20 " 

Hi Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 

258 30 35 40 



page* l SEQUENCE VERIFICATION REPORT DATE: 1 1/30/97 

PAGE ' 1 PATENT APPLICATION US/08/915,004 TIME: 14:01:22 

INPUT SET: S21826.mw 



Line 



Error Original Text 



